Design and Evaluation of a Subblock Cache Coherence Protocol for Bus-Based Multiprocessors
نویسنده
چکیده
Parallel applications exhibit a wide variety of memory reference patterns. Designing a memory architecture that serves all applications well is not easy. However, because tolerating or reducing memory latency is a priority in e ective parallel processing, it is important to explore new techniques to reduce memory tra c. In this paper, we describe a snoopy cache coherence protocol that uses a large sized transfer block (to take advantage of spatial locality) while using a small coherence block in order to avoid false sharing. To further illustrate the protocol, we present an example of its workings. We then present the results of simulating our protocol on 5 applications that exhibit a variety of reference patterns. We nd that our protocol e ectively takes advantage of spatial locality while avoiding the increase in false sharing that often occurs when using large line sizes.
منابع مشابه
Minerva: An Adaptive Subblock Coherence Protocol for Improved SMP Performance
The major limitation on the performance of shared memory multiprocessors running parallel programs is the memory traÆc due to sharing, i.e., the coherence or consistency induced memory traÆc. Much of this traÆc occurs due to false sharing (when two or more processors use disjoint portions of the same cache block) and dead sharing (the transfer of unreferenced words in a block when the block mov...
متن کاملOn the effectiveness of sectored caches in reducing false sharing misses
In this paper we study the eeectiveness of sectored caches in reducing false sharing misses on bus-based multiprocessors. In a sectored cache, each cache line is divided into several subblocks. A subblock is a basic coherence unit. When false sharing occurs, the involved cache line needs not be invalidated or transferred , as long as the corresponding subblocks are kept coherent. To facilitate ...
متن کاملA performance evaluation of cache injection in bus-based shared memory multiprocessors
Bus-based shared memory multiprocessors with private caches and snooping write-invalidate cache coherence protocols are dominant form of smallto medium-scale parallel machines today. In these systems the high memory latency poses the major hurdle in achieving high performance. One way to cope with this problem is to use various techniques for tolerating high memory latency. Software-controlled ...
متن کاملDesign of Large-Scale Symmetric Multiprocessors (SMPs) using Parallel Optical Interconnects
In this paper, we address the primary limitation of bandwidth demands for address transaction in future cache coherent symmetric multiprocessors (SMPs). As a solution, we propose a scalable optical address sub-network called Symmetric Multiprocessor Network (SYMNET). SYMNET, not only has the ability to pipeline address requests, but also multiple address requests from different processors can p...
متن کاملSYMNET: an optical interconnection network for scalable high-performance symmetric multiprocessors.
We address the primary limitation of the bandwidth to satisfy the demands for address transactions in future cache-coherent symmetric multiprocessors (SMPs). It is widely known that the bus speed and the coherence overhead limit the snoop/address bandwidth needed to broadcast address transactions to all processors. As a solution, we propose a scalable address subnetwork called symmetric multipr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1994